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Abstract — We investigate the maximal achievable rate for 
a given blocklength and error probability over quasi-static 
single-input multiple-output (SIMO) fading channels. Under mild 
conditions on the channel gains, it is shown that the channel 
dispersion is zero regardless of whether the fading realizations 
are available at the transmitter and/or the receiver. The result 
follows from computationally and analytically tractable converse 
and achievability bounds. Through numerical evaluation, we 
verify that zero dispersion indeed entails fast convergence to 
outage capacity as the blocklength increases. In the example of a 
particular 1x2 SIMO Rician channel, the blocklength required to 
achieve 90% of capacity is about an order of magnitude smaller 
compared to the blocklength required for an AWGN channel 
with the same capacity. 

I. Introduction 

We study the maximal achievable rate R*{n,e) for a given 
blocklength n and block error probability e over a quasi-static 
single-input multiple-output (SIMO) fading channel, i.e., a 
random channel that remains constant during the transmission 
of each codeword, subject to a per-codeword power constraint. 
We consider two scenarios: 

i) perfect channel-state information (CSI) is available at 
both the transmitter and the receiverj] 

ii) neither the transmitter nor the receiver have a priori CSI. 

For quasi-static fading channels, the Shannon capacity, 
which is given by the limit of R*{n. e) for n — > oo and 
e — >^ 0, is zero for many fading distributions of practical 
interest (e.g., Rayleigh, Rician, and Nakagami fading). In this 
case, the e-capacity ||T| (also known as outage capacity), which 
is obtained by letting n -> oo in i?* (n, e) for a fixed e > 0, 
is a more appropriate performance metric. The e-capacity of 
quasi-static SIMO fading channels does not depend on whether 
CSI is available at the receiver fH, p. 2632]. In fact, since the 
channel stays constant during the transmission of a codeword, 
it can be accurately estimated at the receiver through the 
transmission of known training sequences with no rate penalty 
as n — > oo. Furthermore, in the limit n -> oo the per-codeword 
power constraint renders CSIT ineffectual [3, Prop. 3], in 
contrast to the situation where a long-term power constraint is 
imposed Q- 

'Hereafter, we write CSIT and CSIR to denote the availability of perfect 
CSI at the transmitter and at the receiver, respectively. The acronym CSIRT 
will be used to denote the availability of both CSIR and CSIT. 



Building upon classical asymptotic results of Dobrushin 
and Strassen, it was recently shown by Polyanskiy, Poor, and 
Verdu |j5| that for various channels with positive Shannon 
capacity C, the maximal achievable rate can be tightly ap- 
proximated by 



i?*(n,e) = C 



Here, Q^^{ ) denotes the inverse of the Gaussian Q-function 
Q{x) = J^-^e~* /^di and V is the channel dispersion |5 
Def. 1]. The approximation ([T]l implies that to sustain tm 
desired error probability e at a finite blocklength n, one pays 
a penalty on the rate (compared to the channel capacity) 
that is proportional to Xj^fn. For the CSIR case, the dis- 
persion of single-input single-output AWGN channels with 
stationary fading was derived in |6], and generalized to block- 
memoryless fading channels in IItT. 

Contributions: We provide achievability and converse 
bounds on R*{n,e) for a quasi-static SIMO fading channel. 
The asymptotic analysis of these bounds shows that under mild 
technical conditions on the distribution of the fading gains. 



log n 



(2) 



This result implies that for the quasi-static fading case, the 
l/y/n rate penalty is absent. In other words, the e-dispersion 
(see |5, Def. 2] or ( |33| l below) of the quasi-static fading 
channels is zero. This result turns out to hold regardless of 
whether CSI is available at the transmitter and/or the receiver 

Numerical evidence suggests that, in some scenarios, the 
absence of the l/\/ri term in (|2| implies fast convergence 
to Ce as n increases. For example, for a 1 x 2 SIMO Rician- 
fading channel with = 1 bit/channel use and e = 10^"^, the 
blocklength required to achieve 90% of is between 120 and 
320, which is about an order of magnitude smaller compared 
to the blocklength required for an AWGN channel with the 
same capacity. In general, to estimate R* {n, e) accurately 
for moderate n, an asymptotic characterization more precise 
than Q is required. 

Our converse bound on R*{n,e) is based on the meta- 
converse theorem |5, Thm. 26]. Application of standard 
achievability bounds for the case of no CSI encounters 
formidable technical and numerical difficulties. To circumvent 



them, we apply the k(3 bound |5, Thm. 25] to a stochastically 
degraded channel, whose choice is motivated by geometric 
considerations. The main tool used to establish (|2]) is a Cramer- 
Esseen-type central-limit theorem |j8] Thm. VI. 1]. 

Notation: Upper case letters such as X denote scalar 
random variables and their realizations are written in lower 
case, e.g., x. We use boldface upper case letters to denote 
random vectors, e.g., X, and boldface lower case letter for 
their realizations, e.g., x. Upper case letters of two special 
fonts are used to denote deterministic matrices (e.g., Y) 
and random matrices (e.g., Y). The element-wise complex 
conjugate of the vector x is denoted by x. The superscripts ^ 
and ^ stand for transposition and Hermitian transposition, 
respectively. The standard (Hermitian) inner product of two 
vectors x = [xi ■ ■ ■ x„]^ and y = [yi 



<x,y>^ ^Xiyi. 

i=l 



(3) 



The Euclidean norm is denoted by ||a;|p =< x,x >. Fur- 
thermore, CjV(0, A) stands for the distribution of a circularly- 
symmetric complex Gaussian random vector with covariance 
matrix A. Given two distributions P and Q on a common 
measurable space W, we define a randomized test between 
P and Q as a random transformation Pz\w ■ W H> {0, 1} 
where indicates that the test chooses Q. We shall need the 
following performance metric for the test between P and Q: 

P^{P,Q)^mmJ Pz\wi'i^\w)Qidw) (4) 

where the minimum is over all probability distributions Pz \ w 
satisfying 



Pz\w{l\w)P{dw) > a. 



(5) 



We refer to a test achieving Q as an optimal test. The indicator 
function is denoted by !{•}. Finally, log(-) indicates the nat- 
ural logarithm, and Beta(-, •) denotes the Beta distribution ||9] 
Ch. 25]. 

II. Channel Model and Fundamental Limits 

We consider a quasi-static SIMO channel with r receive 
antennas. The channel input-output relation is given by 



Y = xH^ + W 

'xiHi + Wii 



XiHr + Wir 



(6) 



(7) 



yXnHi + Wnl ■ ■ ■ XnHr + Wn 



The vector H = [Hi ■ ■ ■ Hr]^ contains the complex fading 
coefficients, which are random but remain constant for all n 
channel uses; {W;„i} are independent and identically dis- 
tributed (i.i.d.) CJ\f{0, 1) random variables; x = [xi ■ ■ ■ Xn]^ 
contains the transmitted symbols. 

We consider both the case when the transmitter and the 
receiver do not know the realizations of H (no CSI) and the 
case where the realizations of H are available to both the 



transmitter and the receiver (CSIRT). Next, we introduce the 
notion of a channel code for these two settings. 
Definition 1: An (n, e)no-csi code consists of: 

i) an encoder /: {1, . . . , A/} ^ C" that maps the message 
J G {1, . . . , M} to a codeword x £ {ci, . . . , cj\/}. The 
codewords satisfy the power constraint 

\\c,\\^<np, i^l,...,M. (8) 

We assume that J is equiprobable on {1, ... , M}. 

ii) A decoder g: C"^*" {1, . . . , M} satisfying 

P[.9(Y) ^ J] < e (9) 

where Y is the channel output induced by the transmitted 
codeword according to (|6|. 

The maximal achievable rate for the no-CSI case is defined as 

e) = sup j : 3{n, M, e)no-csi codel . (10) 



Definition 2: An (n, M , e)csiRT code consists of: 

i) an encoder /: {1,...,M} x C ^ C" that maps the 
message J e {1,...,M} and the channel if to a 
codeword x E {ci{H), . . . ,cm{H)}; the codewords 
satisfy 

\\c,{h)\\^<np, Vi = l,...M, V/i G C. (11) 

We assume that J is equiprobable on {1, ... , M}. 

ii) A decoder g: C"^'' x C {1, . . . , M} satisfying 

P[g{Y,H)^J]<e. (12) 
The maximal achievable rate for the CSIRT case is defined as 

R:,{n, e) ^ supj : 3(n, M, e)csiRT codej . (13) 
It follows that 

RUn,e)<R:,{n,e). (14) 

Let \\H\\^, and define 

FciO^VMl + pG)<^]. (15) 

For every e > 0, the e-capacity Ce of the channel (|6]l is |[T] 
Thm. 6] 

= lim i?*o(n, e) = lim R*t{n, e) = sup : Fc{£,) < e} • 

n— ^oo n— >oo 

(16) 

III. Main Results 

We present next a converse (upper) bound on i?rt(" 7 e) and 
an achievability (lower) bound on R'^^{n, e). The two bounds 
match asymptotically up to a 0{\og{n)/n) term, allowing us 
to estabHsh 



A. Converse Bound 
Theorem 1: Let 



L„^nlog(l + pG) + ^(l- IT^Z,- Vl + pG 



and 



1 + pG 



(17) 



(18) 



with G = and {^Jf^^ i.i.d. CA/'(0, 1) -distributed. For 

every n and every < e < 1, the maximal achievable rate 
on the quasi-static SIMO fading channel (|6| with CSIRT is 
upper-bounded by 

i?;(n-l,e) < -l^log: ^ 



(19) 



(20) 



n-l 

where 7„ is the solution of 

Proof: See Appendix [A| ■ 

B. Achievability Bound 

Let Ziy) : C"^'' {0, 1} be a test between an arbitrary 
distribution Qy and Py\x=x^ where Z = indicates that the 
test chooses Qy- Let J' C C" be a set of permissible channel 
inputs as specified by (|8]l. We define the following measure 
of performance Qy) for the composite hypothesis test 

between Qy and the collection {PY\x=x}xeJ^- 



^r(^,QY) =infQY [ZiY) = 1] 



(21) 



where the infimum is over all deterministic tests Z{-) satisfy- 
ing: 

i) Py\x=x [Z{r) = 1] > r, Va; e ^, 

ii) Z{Y) = Z{Y) whenever the columns of Y and Y span 
the same subspace in C". 



Note that, Qy) in (21 1 coincides with Qy) de- 

fined in ||5] eq. (107)] if the additional constraint [njl is dropped 
and if the infimum in ( pTj l is taken over randomized tests. 
Hence, 

'«r(^,Q¥) < Kr(^,QY). (22) 

We will need the following definition. 

Definition 3 ([10, Def. 4]): Let a be a nonzero vector and 
let B be an ^-dimensional (I < n) subspace in C". We define 
the angle 9{a,B) G [0, 7r/2] between a and B by 

cos 9{a,B)= max (23) 

6ee,||b||=i ||a| 

With a slight abuse of notation, for a matrix B e C"^' we 
use 6{a, B) to indicate the angle between a and the subspace B 
spanned by the columns of B. In particular, if the columns of 
B are an orthonormal basis for B, then 



cos( 



(a,B) 



|a"B|l 



(24) 



Theorem |2] below establishes a lower bound on R*^^{n,e). 

Theorem 2: Let C C" be a measurable set of chan- 
nel inputs satisfying ([HJ. For every < e < 1, every 
< T < e, and every probability distribution Qy, there exists 
an (n, A/, e)„o-csi code satisfying 



M > 



snp^e^QY[ZUV = 1] 
where 

Z,(Y) = l{cos20(a;,Y) > I - jn{x)} 
with 7ri(aj) G [0, 1] chosen so that 

Py\x=x[Zx{Y) = 1] > 1 - e + T. 



(25) 



(26) 



(27) 



Proof: The lower bound ( 25 i follows by applying the k(3 
bound (5] Thm. 25] to a stochastically degraded version of (j6|, 
whose output is the subspace spanned by the columns of Y. 



The geometric intuition behind the choice of the test ( 26 1 is 



that a; in (j6|l belongs to the subspace spanned by the columns 
of Y if the additive noise W is neglected. 

In Corollary [3] below, we present a further lower bound 
on M that is obtained from Theorem [2] by choosing 



QY = Y[CN-iO,\r) 



(28) 



and by requiring that the codewords belong to the set 

Tn = {xe C" : \\xf = np} . (29) 

The resulting bound allows for numerical evaluation. 

Corollary 3: For every < e < 1 and every < t < e there 
exists an (n, M, e)no-csi code with codewords in the set J>j 
satisfying 



M > 



r,r) 



(30) 



where F{-;n — r,r) is the cumulative distribution function 
(cdf) of a Beta(n — r, r)-distributed random variable and 
7„ G [0, 1] is chosen so that 



with 



PY\x=xo[ZxoW = l]>l-€ + r (31) 



■VpV- (32) 



xa = [VP VP 
Proof: See Appendix [B] 

^Corollary [3] holds actually for every isotropic distribution on Y. 



C. Asymptotic Analysis 

Following 1 5 Def. 2], we define the e-dispersion of the 
channel ^ via R*^^{n,e) (resp. R*^^{n,e)) as follows 



V"° = limsupn 



V/* — lim sup n 



C,-R*,{n,e) 



(0,l)\{i} (33) 
(0,1)\{^}. (34) 



The rationale behind the definition of the channel dispersion 
is that — for ergodic channels — the probability of error e and 
the optimal rate R*{n,e) roughly satisfy 



C 



Z < R*in,e) 



(35) 



where C and V are the channel capacity and dispersion, 
respectively, and Z is a zero-mean unit-variance real Gaussian 
random variable. The quasi-static fading channel is condition- 
ally ergodic given H, which suggests that 



C(H) 



y(H) 



Z < R*{n,e) 



(36) 



where C(H) and 1^(H) are the capacity and the dispersion of 
the conditional channels. Assuming that Z is independent of 
H, we expect that ((36]l be well-approximated by 



P[C(H) < R*{f 



(37) 



Indeed, given H = ft,, the probability ¥[Z < {R*{n,e) - 
C{h))/ y^V{h)/n] is close to one in the "outage" case 
C(h) < R*(n, e), and close to zero otherwise. This observa- 
tion is formalized in the following lemma. 

Lemma 4: Let X be a random variable with zero mean, 
unit variance, and finite third moment. Let Y be independent 
of X with twice continuously differentiable probability density 
function (pdf) /y. Then 



lim ri 

n— ^oo 



3/2 



p[x < - p[y > o] + 



2n 



< (38) 



where ki ^ {k2{S)/6 + 2S-^)E[\X\^], k2{S) = 
sup,g(_,,5)max{|/y(t)|,|/;,(t)|,|/{;(t)|} and (5 > is cho- 
sen so that k2 is finite. 

Proof: See Appendix [C] ■ 
From ([36| and ( |37| i, and recalling ( [T6] l we may expect that 
for a quasi-static fading channel the maximal achievable rate 
satisfies 

i?* (rt, e) = + • + smaller-order terms . (39) 

This intuitive reasoning turns out to be correct as the 
following result demonstrates. 

Theorem 5: Assume that the channel gain G = ||-f^||^ has 
a twice continuously differentiable pdf and that Cg is a point 
of growth of the capacity-outage function ( fTSj ), i.e.. 



> 0. 



(40) 



Then, the maximal achievable rates satisfy 



(41) 
(42) 



Hence, the e-dispersion is zero for both the no-CSI and the 
CSIRT case: 

yno ^yrt ^ ^ 1)\{1/2} . (43) 

Proof: See Appendix [P] ■ 
The assumptions on the channel gain are satisfied by all 
probability distributions commonly used to model fading, 
e.g., Rayleigh, Rician, and Nakagami. However, the standard 
AWGN channel, which can be seen as a quasi-static fading 
channel with G — 1, does not meet these assumptions and in 
fact has positive dispersion \^ Thm. 54]. 

D. Numerical Results 

Fig. [T] shows the achievability bound ( pO] ) and the converse 
bound p9| for a quasi-static SIMO fading channel with two 
receive antennas. The channel between the transmit antenna 
and each of the two receive antennas is Rician-distributed with 
^fC-factor equal to 20 dB. The two channels are assumed to be 
independent. We set e — lO^'' and choose p — —1.55 dB so 
that Ce = 1 bit/channel use. For reference, we also plotted a 
lower bound on i?i*t(?i, e) obtained by using the k(3 bound Q 
Thm. 25] and assuming CSIR0 Fig. Ill shows also the approx- 
imation ([TJ for R* {n, e) corresponding to an AWGN channel 
with C = 1 bit/channel use. Note that we replaced the term 
O{log{n)/n) in ^ with log(n)/(2n) (see |5^ eq. (296)])^ 
The blocklength required to achieve 90% of the e-capacity of 
the quasi-static fading channel is in the range [120, 320] for the 
CSIRT case and in the range [120, 480] for the no-CSI case. 
For the AWGN channel, this number is approximately 1420. 
Hence, for the parameters chosen in Fig. [T] the prediction 
(based on zero dispersion) of fast convergence to capacity is 
validated. 
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Appendix A 
Proof of Theorem[T] 

For the channel (|6]) with CSIRT, the input is the pair 
{X,H), and the output is the pair {Y,H). Note that the 
encoder induces a distribution Px \h on X and is necessarily 
randomized, since H is independent of the message J. Denote 
by e) the maximal achievable rate under the constraint 



that each codeword Cj{h) satisfies the power constraint (11 



with equality, namely, Cj{h) belongs to the set J>, defined 
in (|29]l for j = 1, . . . , M and for all h e C. Then by |5] 
Lem. 39], 

i?:t(n-l,e) < ^i?:(n,e). (44) 
n — 1 

We next establish an upper bound on Rl{n, e). Henceforth, x 
is assumed to belong to Tn- To upper-bound Rl{n, e), we use 
the meta-converse theorem [5 , Thm. 26]. As auxiliary channel 
Qyh I XH, we take a channel that passes H unchanged and 
generates Y according to the following distribution 



^Y[C^fiO,\r + phh"). (45) 



In particular, Y and X are conditionally independent given H. 
Since H and the message J are independent, Y and J are 



independent under the auxiliary Q-channel. Hence, the average 
error probability e' under the auxiliary Q-channel is bounded 

as 



e' > 1 



1 

M' 



(46) 



Then, ^ Thm. 26] 

ni?:(n,e) 



< sup log 

Px I H 



h-tiPxYH, PhPx I hQy I h) 



(47) 



where /3i_e(-, •) is defined in Q, and the supremum is over 
all conditional distributions Px \ h supported on Fn ■ We next 



note that, by the spherical symmetry of Tn and of (45 1, 
the function /^^(-Py | x=x.H=h, Qy \ H=h) does not depend on 
X e J-n- By |j5|, Lem. 29], this implies 



Pa{PxY\H=hi Px I H=hQY\H=h) 
= Pa{PY\X=xo,H=h, QY\H=h) 



(48) 



(with Xq defined in ( (32] i) for every Px \ H=h supported on Tn, 
every h E C", and every a. Consider the optimal test Z for 



PxYH versus PhPx \ hQy\ h under the constraint that 
Pxyh[Z = 1] = / PxY\H=h[Z = l] dPH{h) > 1 - e. 



We have that 

Pl-eiPxYH, PhPx I hQy | h) 



(49) 



^ j Px\H=hQY\H=h[Z =l]dPH{h) (50) 
> j Pa{h){PxY\H=h:Px\H=hQY\H=h)dPH{h) (51) 
= j Pa{h){PY\X=xo,H=hiQY\H=h)dPH{h) (52) 



where (52 1 follows from (48 i. Fix an arbitrary h e C", and let 



Z^^ be an optimal test between Py | x=xoM=h and Qy | H=h, 
i.e., a test satisfying 



PY\X=a:o,H=h[Zh = 1] > a(^) 



(53) 



and 



QY\H=h[Zh — 1] — /3Q(h,)(-PY| X=a:o,H"=h' QyI (54) 

Then Z|j is a test between PhPy \ x=xo,h ™d PhQy\h- 
Moreover, 

y A'|x=x„M[^h = l]c^^HW> y a{h)dPH{h) (55) 

> 1 - e (56) 
where ( [56] l follows from (|49]). Consequently, 

QY\H=h)dPH{h) 

= y QY\H=h[Zl = l]dPH{h) (57) 

>/3l_,(P«PY|X=xo.ff,^/fQY|ff) (58) 



where ( 57 1 follows from ( 54 



tion of /3i_e(-, •) and by (|56 
obtain that 



and ( 58 i follows by the defini- 



. Substituting (58 i into (52 1, we 



Pi~c{PxYH, PhPx \ hQy \ h) 

>Pl-e{PHPY\X=x,,H,PHQY\H) (59) 



for every Px \ h supported on Tn- It can be shown that (59 1 
holds, in fact, with equality. 

In the following, to shorten notation, we define 



Pq — PhPy\x=xo,H' Qo — PhQy\h- 



Using this notation, (47i becomes 

nRlin.e) < - log /3i_,(Fo, Qo) 

Let 

dPo 



r{xo;YH)^\og 



dQo ' 



(62) 



By the Neyman-Pearson lemma (see for example |jT2| p. 23]), 

/3i_e(Po,go) = QoHxo:YH) > nj^] (63) 



where 7„ is the solution of 

Po[r{xo;YH) <n-u] 



(64) 



We conclude the proof by noting that, under Qq, the random 
variable r{xQ;YH) has the same distribution as Ln in ( [T7] i, 
and under Pq, it has the same distribution as Sn in ( [18] ). 

Appendix B 
Proof of Corollary[3] 

Due to spherical symmetry and to the assumption that 
X e J^n, the term Py | x=x [cos^ 9{x, Y) > 1 — 7n] on the LHS 
of ( p6l l, does not depend on x. Hence, we can set x = Xq- 

We next evaluate sup^^jr^ Qy[Zx{Y) — 1] for the Gaussian 
distribution Qy in ( |28| l. Under Qy, the random subspace 
spanned by the columns of Y is r-dimensional with probability 
one, and is uniformly distributed on the Grassmann manifold 
of r-planes in C" 1 13 Sec. 6]. If we take A ^ Qa = 
CJ\f{0, l„) to be independent of Y ~ Qy, then for every 
X e J-n and every Y e C"^'" with full column rank 

QY[ZxiY) = 1] = Qy.a[^a(Y) = 1] (65) 
= Qa[^a(Y) = 1]. (66) 

1] does not depend on x; 



In (65]) we used that Qy[^x(Y) 
(|66 



holds because Qa is isotropic. 
To compute the RHS of ( ]66] l, we will choose for simplicity 

I. 



Y 







(n— r) X r 



(67) 



The columns of Y are orthonormal. Hence, by ( 24 1 and ( 26 1 

iA"Y||2 



Qa[^a(Y) = 1] = Qa 



II^IP 



> 1 - 7n 



< 7n 



(68) 



(69) 



where Ai 
ratio 



CA/'(0, 1) is the ith enti-y of A. Observe that the 



En 
i— 



■i— r+1 



I A: 



(70) 



is Beta(n — r, r)-distributed |(9l, Ch. 25.2]. 

To conclude the proof, we need to compute kr{J-n,QY)- 
If we replace the constraint |i| in ( ]2T| i by the less stringent 
constraint that 

Py[Z{Y) = 1] = E^(„„,o [Py|x[^(Y) = 1]] > r (71) 



input distribution p^™^^ on Tn, we get an infimum in ill 



(60) with Py being the output distribution induced by the uniform 
(61) 



which we denote by Kt-, that is no larger than iir{J-n,Q\ 
Because both Qy and the output distribution Py induced by 



P 



(unif) 



X 



are isotropic, we conclude that 

Py[Z{Y) = 1] = Qy[Z{Y) 



1] > r 



(72) 



for all tests Z{Y) that satisfy ( [tT] ) and the constraint [u]) in 
( |2T] ). Therefore, 



K-riJ^n, Qy) > Kr = T. 



(73) 



Appendix C 
Proof of Lemma|4] 

By assumption, there exist S > and fc2 < co, such that 



max{\fyit)l\Mt)\,\mm<k2 
for all t e (-5, S). Let Fy be the cdf of Y. We write 

¥[X < ^/ViY] = ¥[Y > x/y^]dPx 

P[Y > x/Vn\dPx 



The term C2{n) can be bounded as 

|c2(n)| = [6V^ ■ \Xf ■ 1{\X\ > Sy^i}] (86) 



(74) 



<j-^E[\X\'-l{\X\>SV^}] 



Finally, C3(n) can be bounded as 



(75) 



1^1^1/^(^0)1 



\x\<5^/n 



\Y > x/y^]dPx 



<E[|X|3.1{|X| <dV^}] 



1{\X\ < Sy^} 

k2 



(87) 

(88) 
(89) 



<E[|X|3] 



/ {1 - FY{x/^/n))dPx- (76) Here, ([89]) follows because |/y(a;o)| < /c2 by assumption 

J\x\<s^/^ Combining (|76]l and (l80|, we obtain 



We next evaluate the two terms on the RHS of ( [76| l. For the 
first term, we have that 

F[Y > x/Vn\dPx < I dPx (77) 

n J\x\'>5-\/n 

E[|X|31 



< 



^3„3/2 



(78) 



where (78 i follows from Markov's inequality. To compute the 



second term on the RHS of ( 76 1, we note that, by Taylor's 
theorem |14, Thm. 5.15], for all x G {—5^/n,5^Jn), 

Fy{x/^) 

In D n''!^ 

for some xq e (^.xj^n). Averaging over X, we get 

FY{x|^f^)dPx 
^y(0)(l- P[|x| > V"] ) 



<5-3E[|X|3 



/r(0) 



/^(O) 



E[X-1{|X| <(5V^}] 



E 



=ci(ji) 

(e [X^] - E . 1{|X| > (57^}] 

=C2(n) 

1{|X| < <5V^} 



2n 

gj^3/2 



=C3(n) 

The term c\(ri) can be bounded as 

|ci(n)| = |E[X] -E[X • \{\X\ > 

=0 

= |E[X • \{\X\ > S^/7i}] I 
< E[|X| • 1{\X\ > Sy^}] 

= -^E[6^n\X\-l{\X\>5V^}] 
<±-E[\X\'-l{\X\>6VVi}]. 



(80) 



(81) 

(82) 
(83) 

(84) 
(85) 



,3/2 



¥[X < y^Y] - F[Y > 0] 



2n 



V[Y > x/y/^]dPx 

\x\>Sy^ 

+ (l-^^y(0))P[|X| >,5^] 

<1 

-ci{n) + — C2{n) - c-i{n) 



2n 



(90) 



(79) <rt[ixn+rt[ixn 



k2E[\X\ 



k2 (5-2 + (2,5)-i)E[|X|3 . 1{\X\ > SVn}] (91) 



=C4(n) 

(fc2/6 + 2r3)E[|X|3] +A:2C4(n) 



(92) 



where (|9T) follows from (|78]l, ([85]) and ([87|. The proof is 
concluded by taking ?i — > oo on each side of (92 1, and by 
using that 



lim Ci{n) = 0. 

n— f oo 

Appendix D 
Proof of Theorem|5] 



(93) 



To establish Theorem [5] we study the converse bound (19 1 
and the achievability bound ( pO] ) in the large-n limit. 

A. Converse 

We begin by upper-bounding the RHS of ([T9jl by recalling 
that for every 7 > g Eq. (102)] 



a< P 



dP 
dQ 



>7 



7MP,Q). 



(94) 



Using ( |94| i on ( |63] l and setting 7 = e"'''" we obtain 

/3i-.(i^o,Qo) 

> e""T"(l - e - Po[r{xo;YH) > nj^] ) 
= e-"^"(Po[r{xo;YH) < n7„] - e). 



(95) 
(96) 



This allows us to upper-bound the RHS of ([T9|) as 



1 



n — 1 

for every 7„ satisfying 

V[Sn < n7„] > e 
We shall take 7„ so that 

P[5„ < n7„] = e + 



7„- -log(P[5„ <n7„] -e) 



1 



For this choice of 7„, (|97]i reduces to 



R*An-l,e)< 



n-1 



In + 



logn 



(97) 



(98) 



(99) 



(100) 



The proof is completed by showing that ( 99 1 holds for 

7„ = C, + 0(l/n). 



(101) 



To prove ( 101 1, we evaluate P[S'„ < n7„] in the limit 
n — > oo up to a o(l/n) term. Note that, given G, the random 
variable Sn is the sum of n i.i.d. random variables with mean 
/^(G) = log(l + pG) and variance 

pG{pG + 2) 



a\G) ^ 



Hence, 



where 



rp A 1 



{l + pGY ■ 

n 



l + pG 



(102) 



(103) 



(104) 



are zero-mean, unit-variance random variables that are condi- 
tionally independent given G, ancj^ 

A e - P{G) 



a{G) 



(105) 



The following lemma, which is based on a Cramer-Esseen- 
type central-limit theorem |8 Thm. VI. 1] and on Lemma |4] 
shows that ( 103 1 can be closely approximated by P[C7(^) > 0]. 



Lemma 6: Let {Tj}"^! be given in (104 1 and let U{C) be 
given in (105 1 with G satisfying the assumptions in Theorem[5] 
Take an arbitrary > that satisfies P[[/(Co) > 0] > 0. Then 
there exists a (5 > so that 



lim sup n'^/^ 
"-^°°Ce(?o-<5,Co+5) 



n 



PiO > 0] 



2n 



< oo (106) 



^We shall write J7(§) simply as U whenever stressing its dependence on 
5 is unnecessary. 



where 

9(6^/a(e)(0) (107) 
- ;^/g( — ) ^/g( — ).(108) 



P \ P J 
Proof: See Appendix |E] 



P 



P 



Note that 



P[t/(e) > 0] = P[m(G) < e] = (109) 



where Fc{^) is defined in ( 15 i. Hence, setting ^ = 7„ and 
= Gc, we get 

2n 

where C'(n^'^/^) is uniform in 7„ e (G^ — (5, G^ + S) for some 
(5 > 0. Substituting into (|99]), we finally obtain 



Fciln) 



2n 



(Ill) 



By Taylor's theorem p4| Thm. 5.15] 



+ 0(1) (7„-G,). 



(112) 



Substituting ( |112| into ( |111[ ) and using that i^c(C'£) = we 
get 

^(C,) + 2 



7« = C'c 



1 











+ 0(1/71). (113) 



The proof of ( |101| l is concluded by noting that, by assumption, 

q{G,) < 00 and > 0. 

B. Achievability 

We set T = 1/n and 7„ = cxp(-Ge +0(l/n)) in ([30} and 
we use that 



Tin) 



T(n — r)r(r) 



r(ii) 
r(7i - r + i)r(r) 

< n'-i7r''- 



^(n-r)-l^^ (114) 

(115) 
(116) 



This yields. 



logM^ log(n)^^/l 



(117) 



To conclude the proof, we show that the choice 7„ = 

exp(— Ge + 0{l/n)) satisfies 

PY|x=xo[^=.o(Y) = l]>l-e + l/n. (118) 
Given H ^ h ^ 0, we have thaj^ 

cos0(a;o,Y)- niax^^^^lfl^ (119) 



> 



aeC'-\{o} ||a;o||||Ya| 
<a;o,¥7i> I 



l^olll|Y/i|| 

*Note that H = with zero probability. 



(120) 



Then 

J\|x=xo[cos2^(a;o,Y) > 1 - 7n] 
= Eh [Py I ff=h.x=x„ [cos2 0(a;o, Y) > 1 - 7„]] (121) 
<xo,Yh> |2 



|a;o||2||Y/i||2 



> 1 - 7r, 



I X=cco 



<a;o,Y.H'> p 
||a;o||2||YS^||2 



> 1 - 7" 



(122) 
(123) 



UnderPY«|x=xo,theterm|<a;o,YJf>|V(||a;o|P||Y/f||2) 
is distributed as 

/np\\H\\^ + n-^/^j:"W''H^ 

(124) 



where Wj - C7V(0, 1^). Note that W^^H has the same 
distribution as \/GZj, where Zj ^ CAf{0, 1). Hence, the 
random ratio in ( |124| i is distributed as 

Etl\Z^ + VG-p\' 

Therefore, 

<Xo,YH> 12 



(125) 



f/ I X=xo 



|a;||2||Yif|| 



> l-7« 

|2 



T -L ^ 7ri 

E:U|^. + ^/Gp|' " . 



> 



. Eti\z. + VGp\ 

^ (1 - 7„)Zj - 7„a/Gp 



< 7« 



< njnGp 



1 " _ 



where 



^ 7„Gp-/i(G) 7„(1 + Gp)-1 



(126) 
(127) 
(128) 
(129) 

(130) 
(131) 



and 



with 



and 



a{G) V(l - 7n)2 + 27^Gp 

(l-7„)Z,-7„v^|'-/i(G)^ (132) 

A(G)^(l-7„)'+7'Gp (133) 

a^G) ^ (1 - 7n)' [(1 - In? + Hfip] . (134) 



Note that, are zero-mean, unit-variance random vari- 

ables that are conditionally independent given G. 



To summarize, we showed that 

P¥|x=x„[cos2 0(a;o,Y) > 1 - 7«] 
1 " ~ 



> 



< JnU 



(135) 



To conclude the proof, it suffices to show that 7„ 

exp(-Ge + 0{l/n)) yields 



1 " ~ 



< ^/nU 



1 



(136) 



To this end, we proceed along the hnes of the converse proof 
to obtain 



< JnU 



= P[[/ > 0] - ^ + 0(n-3/2) 



2n 



(137) 



where 



'/G(.9o) + -/G(go) (138) 



with 7„ ^ — log7„ and go = (e'''" — l)/p, and where 
©(n"^/^) is uniform in 7„ e (G^ — (5, G^ + (5) for some S > 0. 
We further have that 

P[C7 > 0] = P[log(l + Gp) > 7n] - 1 - Fci%). (139) 



Substituting ( fT38] l and ([139]) into ( [T37| ), and then ( [T37] i into 
( |136| l, we get 



1 

e . 

n 



(140) 



Finally, using the same steps as in ( |1 1 l[ i-( [TT3] l, we obtain 

7. = C.-^^^^— i + o{l/n) (141) 

= Ge + C'(l/n) (142) 



where (142i follows because q{Cc) < oo and because 



> by assumption. This concludes the proof. 



Appendix E 
Proof of Lemma[6] 

Fix Co > satisfying P[U{^o) > 0] > 0. Observe that 

P[C/(e) > 0] - P[log(l + pG) < e] = FciO (143) 

where Fc{£,) is defined in ( [Ts] ). Since -F'c(C) is continuous in 
^, there exists < 6 < £,o so that Pc(0 > (and hence, 
P[f/(e) > 0] > 0) for every ^ € (^o - ^, Co + 

To establish Lemma [6] we will need the following version 
of the Cramer-Esseen Theorem^ 

^The Berry-Esseen Theorem used in |^ to establish |lj yields asymptotic 
expansions up to a 0{l/^/n) term. This is not sufficient here, since we need 
to establish an asymptotic expansion up to a o(l/n) term. 



Theorem 7: Let {X/^^^^ be a sequence of i.i.d. real random 
variables having zero mean and unit variance. Furthermore, let 



v(t) ^E[e"-^'i] , and F,,(x) = 



1 " 



< a; 



If we let vj^^ = E [e**^^ ] , then 

= cxp( ) • E exp 



(144) 



If E[|Xi|'*] < oo and if sup|(|>^ for some fco < !> 

where C = l/(12E[|Xi|3]), then for all x and n 



exp 



y2a{g)il + pg) 



pgt^ +pg + 2 



-itpgNj 
2a{g){l + pg) 



1 



pg + 2 



-1/2 



(152) 
(153) 

(154) 



<k2f^n-\l + \x\)-^E[\X,\''] (^o + ^) | -(145) 

Here, ki = E [Xj*] /{6^/2^T), and fc2 is a positive constant 
independent of {Xi}^^^ and x. 

Proof: The inequality ( |145| l is a consequence of the 
tighter inequality reported in ||8] Thm. VI. 1]. ■ 
To prove Lemma |6j we proceed as follows. Note that 



where ( |154| l follows from [15, p. 24]. Now, observe that the 
RHS of ( |154| l is monotonically decreasing in t and monoton- 
ically increasing in g. Hence, 



1 " 



< VnU 



E 



G 



1 " 



G 



(145) 

We next estimate the conditional probability on the RHS 
of \\A6) using Theorem |7] In order to do so, we 
need to verify that there exists a fcp < 1 such 
that supggRH- sup|4|>^ |wT,(t)| < fco, where VT^{t) = 
E [e'*^J I G = .g] . We start by evaluating (. For all g G M+, it 
can be shown that 



E[|T,nG: 



15(p5)2 + 36p,g + 12 



< 15. (147) 



{pg + 2)2 

By Lyapunov's inequality |8, p. 18], this implies that 

E[|T,f |G = 5] < (E[|T,f|G = g])'/*< 15^/4. (148) 
Hence, 

1 15-3/4 



> 



12E[|T,-|3|G = g] - 12 
By \IA% we have that 



sup \vT^{t)\ < sup |wT,(i)l 
|t|>C l*l>Co 



(149) 



(150) 



where Co does not depend on g. We now compute |uTj (t)|. 
Observe that given G = g 



1 



pg 



a{g) 2aig){l + pg) 



V2Zi 



(151) 



where the term Nj follows a noncentral distribution with 
two degrees of freedom and noncentrality parameter 2/{pg). 



sup sup \vTjit)\ 
gGK+ |t|>Co 



sup sup < exp . 

|t|>Co V Pgt +P9+2. 



pgt' 

pg+2 



sup < exp 



pgC,o +pg + '2 



a2 \ -1/2' 

pgCo 



pg + 2 



< 



< 1. 



(155) 

(156) 
(157) 



Set fco = 1/ 1 + Co- As we verified that the conditions in 
Theorem |7] are met, we conclude that for all n 

-E[Q(-^[/)] 



.7 = 1 



< ^ |E[(l-nC/2)e-"^'/2 
/n I L 



fcs 

+ -E [(1 + \VnU\)-^] + fcan*^ ( fco + 



(158) 



where fcg = 153/4/(6^2^) and ki = 15fc2. In view of ( fT06l l, 
we note that the last term on the RHS of ( |158| l satisfies 



lim 71,3/2 f i^^j^G ff^^ _|_ J'_ 

n->-oo V V 277, 



Next, we prove the following two estimates 

-4' 



lim sup y/nE 



1 



lim sup n 



E 



;i-n(C/(C))2)e- 



(159) 



< fcs (160) 



< fc6(161) 



for some constants fcs, fcg < oo. Note that since the map 



(162) 



is a diffeomorphism (of class G3) | 16 p. 147] in the region 
^ > 0, .g > 0, the pdf }u((,) (^) of U{^) and its first and second 
derivative are jointly continuous functions of t), and, hence. 



bounded on bounded sets. Specifically, for every ^ e (^o 
S, ^0 + and every 5 > there exists a fc < oo so that 



sup sup 1/(7(0 (^) I < k 
te[-5,s] «e(?o-i5,«o+i5) 

sup sup \fu{^){t)\<k 
te[-s,s] «e(?o-5,Co+<5) 

sup sup 1/^(5) (t) I < fc. 

tG[-5,5]«e(?o-«,4o+i5) 



(163) 
(164) 
(165) 



Fix now (5 > and let k as in ( |163| l-( fT65| ). To prove \l60\ , 
we proceed as follows: 



E 



{i + \v^u\y 



= E 



{l + \VnU\)-H{\U\<6} 



E 



/^U\) h{\U\ > 6} 



s 



(166) 
(167) 



<2k (1 + ^fnty^dt + (1 + ^/^sy 

Jo 

1 - (1 + V^5)-^ + (1 + ^S)-^ (168) 



< 



1 



(169) 



wh ere in ( 167 i we used ( 163 1. This proves ( |160| l. The inequal- 
ity ( 161 1 can be established as follows. First, for n > S^^, 



E 



< 



(1 - nC/2)e-"^'/2 



E 



{nU^ - l)e-"^'/2i{|[7| > S} 



(170) 



— J2 



To evaluate /i, we use the relation (1 — nt^)e "* 
^ ^ie~"* /■^^ and integration by parts to obtain 



te 



-nr/2 



fu{t) 



te 



f'u{t)dt 



,r 1 



< 2Me^"'^ /2 _^ 2fc-(l - e-"''"/^) 



Therefore, 



lim sup nil < 2k. 



For I2 we proceed as follows: 



/2 < E 



< sup {ni2e-»*^/2}. 

|«|>5 



(171) 
(172) 

(173) 



(174) 
(175) 



Note that when n > 2S 2, the function nt'^e is rnono- 

tonically decreasing in i e [5, +00). Hence, 

lim sup nl2 < lim n^S^e^''^^ ^'^ = 0. (176) 



Substituting ( |173| ) and ( |176| ) into ( |170| i, we obtain ( |161| l. 
Combining ( I6O1 and ( 161 1 with (|158|l, we conclude that 



lim sup n 

"^°°«e(5o-5,«o+5) 



3/2 



1 " 
E[Q(-y^C/(0)] 



<A:5 + fc6.(177) 



To conclude the proof of Lemma |6j we need to show that there 
exists a constant kj < 00 such that 



lim sup n'^^^ 



-P[C/(0 > 0] 



2n 



< h (178) 



where fu{^) is the pdf of C/(^). This follows by the uniform 
bounds ( |163| l-( fT65] l, and by J92]i. Note, in fact that the term 
Ci{n) in the proof of Lemma l4j when evaluated for Y — U{S,), 
does not depend on ^. 

Since U{^) = {£, — fi{G))/<T{G), we get after algebraic 
manipulations 



q{0 = fu(,m 



1 



3«-l 



P VP 
This concludes the proof 



3«-l 



(179) 
. (180) 
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